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CN . Abstract 

WO. A new proof of the direct part of the quantum channel coding theorem is shown based on a stand- 

point of quantum hypothesis testing. A packing procedure of mutually noncommutative operators is 
carried out to derive an upper bound on the error probability, which is similar to Feinstein's lemma 
in classical channel coding. The upper bound is used to show the proof of the direct part along with 
a variant of Hiai-Petz's theorem in quantum hypothesis testing. 
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O '■ 1 Introduction 
CN 

Let TL be a Hilbert space which represents a physical system of information carrier. We suppose 
dim7Y < oo for mathematical simplicity. Let £(H) be the set of linear operators on TL and define the 
fS |' totality of density operators on TL by 

S(TL) d = f {p e C{H) | p = P * > 0, Tr[p] = 1 } . (1) 

03 

We will treat a quantum channel defined by a mapping x € X i — > p x G S(TL), where X is a finite set of 
input alphabets and each p x represents the quantum state of the output signal. 

An encoding-decoding system of the message over the n-th extension of the channel is described 
as follows. Each message k G {1, ...,M n } is encoded to a codeword u = x\...x^ l in a codebook 
5_i ■ C n — {u 1 , . . . , u Mn } C X n by an encoder, where X n is the n-th direct product of X, and the codeword 

u k is mapped to p u k — p x k <E> • • • <8> p x n € S(H® n ) through the channel. The decoding process, which 
is called a decoder, is represented by a set X n = {Xi,X%, . . . , Xm„} of nonnegative operators on Tt® n 

satisfying X)fe=i < Im implying that X n with Xq = f /„ — Sfc^i -^k becomes a quantum measurement 
on TL® n taking its value in {0, 1, . . . , M n }. A pair of encoding and decoding processes (C™, X n ) is called 
a code with cardinality M n or with transmission rate R n = log M n /n. 

The probability that the decoder outputs a message k when a message / is sent is given by Tr[p u iXi~\. 
Thus, assuming that all messages arise with the uniform probability, the average error probability of the 
code (C n ,X n ) is given by 

1 M n 

Pe (C», X n ) d =l f — (1 - Tr[/, u ^]) . (2) 

k— 1 
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Our interest lies in asymptotically achievable transmission rates with arbitrarily small error. The channel 
capacity is defined as the supremum of those values: 



C d =sup(i? 3 {(C n , X n )}°°_, such that lim Pe (C™, X n ) = and liminf - \ogM n > Ft\. 



(3) 



In order to describe the quantum channel coding theorem, let us introduce the quantum mutual 
information Jlj] as follows. Let V(X) be the totality of the probability distributions on X, and let 

a p = ^2 x£X p{x)p x be the mixture state by some p G V(X). Then the quantum mutual information is 
defined by 



x£X 

= Y J P{*)D{p x \Wp), (4) 



x£X 

where H(p) d = — Tr[plog/3] is the von Neumann entropy and -D(p||<r) ^ f Tr[p(logp— logo - )] is the quantum 
relative entropy. One of the most significant theorems in quantum information theory is the quantum 
channel coding theorem: 

C= max Hp). (5) 

vev(x) 

The theorem is composed of two inequalities: the direct part (>) and the converse part (<). The 
direct part is given by showing the existence of a good code, and was established in the middle of the 
1990's by Holevo [3] and independently by Schumacher and Westmoreland || after the breakthrough by 
Hausladen et al. 3. The classical counterpart of the method used there is thought as a variant of the 
joint typical decoding || along with the random coding technique. On the other hand, the converse part, 
concerned with nonexistence of too good code, goes back to 1970's works by Holevo |Q ||. In 1999, 
another proof of the direct part was also given by Winter , in which he developed the method of type 
H |9) in the quantum setting followed by a greedy construction of a good code. 

In classical channel coding, the most transparent proof of the direct part is thought to be Feinstein's 
proof JhJ (see also (l^l), the essence of which is described below. Let w n (y n \x n ) be a channel matrix 
transmitting an input sequence x n = xi . . . x n G X n to an output sequence y n = y\ . . . y n G y n , where 
X n and y n are the n-th direct products of the sets X and y of input alphabets and output alphabets, 
respectively. The channel is called stationary memoryless if w n (y n \x n ) = w(yi\xi) . . .w(y n \x n ) holds 
with a one-shot channel w(y\x) (x G X, y G y). If the input X n — X\ . . . X n is a random variable taking 
its value in X n subject to a probability distribution p n {x n ) (x n G X n ), then the output Y n = Y\ . . . Y n 
and the pair (X n 7 Y n ) also become random variables subject to q n {y n ) = J2 x ™ex™ P n {x n )w{y n \x n ) and 
p n (x n )w n (y n \x n ), respectively. In the first part of Feinstein's proof, it is shown that if we have 

f 1 w n (Y n \X n ) 1 
Pr {n lQg V") ^ Q }^° (6) 

for a real number a, then we can construct a reliable code with transmission rate arbitrarily near a. In 
the second part, assuming that p n (x n ) is the independently and identically distributed (i.i.d.) extension 
of a probability distribution p(x) (x G X) and w n {y n \x n ) is a stationary memoryless channel, (||) is shown 
for a < I(X;Y) by the law of large numbers, where I(X;Y) is the classical mutual information for the 
random variables (X, Y) subject to p(x)w(y\x). As for the first part, no assumption such as the stationary 
memoryless property for the channel is needed, which led to a general formula for the classical channel 
capacity by Verdii-Han jl3| as one of the landmarks in the information-spectrum method |l4[ ] . As ways 
of providing the first part, two different methods are known so far; the random coding technique and the 
packing algorithm, both of which give us an important insight into the construction of good codes. 
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In this paper [] a new proof of the direct part for the stationary memoryless quantum channel is 
shown based on a limiting theorem |l6| in quantum hypothesis testing, which is regarded as a variant of 
Hiai-Petz's theorem |17[ , combined with a packing procedure for operators following Winter Q to obtain 
a good code. The limiting theorem in quantum hypothesis testing is thought to be a substitute for the 
law of large numbers used to show in classical information theory. The approach used here is regarded 
as an attempt to develop the information-spectrum method JT^ ] in quantum channel coding, which was 
followed by Hayashi-Nagaoka [ p"8| with further developments using the random coding technique to obtain 
a general formula for the channel capacity in the quantum setting. It should be noted here, however, any 
packing algorithm to derive the general formula is not established yet. 

2 Relation with Hypothesis Testing 

In the sequel, a p — ^2 x< zxP( x )P x ^ s written as a omitting the subscript p when no confusion is likely 
to arise. Let us define block diagonal matrices f| 



» def 
P = 







P{x)Px 







\ 



dcf 







p(x)a 







(7) 



which are density operators in S ((§) x( zx H) and denoted as 



P = ®v{x)Px, 

x£X 



(8) 



x£X 



respectively. It is important to note that the quantum mutual information is nothing but the quantum 
relative entropy between p and a, i.e., D(p\\a) = I(p). In the same way the n-th tensor powers of p and 
<7 are given by the following block diagonal matrices 



(9) 



S (© x n e ^n W®"), respectively, where we used the following notations for each x 11 — x\Xi . . .x n € X n 

p n (x n ) d ^p(x 1 )p(x 2 )...p(x n ), (10) 

'•®Px n - (11) 



def 

Px™ = P X1 <S> Px 2 



Here, let 5 5 .®n(p® n ) be the pinching defined in Appendix Applying Lemma |^ in Appendix ^ induc- 
tively, we can show that 



(12) 



In order to relate the above observation to the channel coding theorem, let us introduce the quantum 
hypothesis testing problem here, which is explained concisely in Appendix [b|, and examine the error 
probability of a test as follows. Given a Hermitian operator X — J2i x iEi, define the projection {X > 0} 

by 



{X > 0} d =i R ' 



(13) 



i:xt>0 



1 The prototype of the results was given in pa] . 

2 The extensions of the density operators usea here were considered by JlQl in an attempt to relate Hiai-Petz's theorem 
to channel coding. 
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With the above notation, we define a test for the hypotheses p® n and a® n as 

S n (a) d = { E^ n {p® n ) ~ e na a® n > 0} 

= {^(p x -)-e no (T a,n >0}, (14) 

where a is a real parameter and the last equality follows from ( |i~2| ) . The error probability of the first kind 
for the test is written as follows by using the notation ~p x „ d = £ a ® n (Px n ) 

&n (S n (a)) = Tr [p® n (l„ - S B (o))" 

= P n {x n )Tt[p xn {p xn -e na a^<Q}), (15) 

which tends to zero exponentially if a < D(p\\a) by Lemma |] in Appendix |b] ]l6| . Thus we have shown 
the following lemma as an analogue of (^3j) in quantum channel coding. 

Lemma 1 For^a < I(p), we have 

hm V p"(x")Tr [p xn {p xn - e"V*> < 0}] = 0. (16) 



3 Greedy Construction of a Code 

In the previous section, we have shown the achievability of the quantum mutual information in the 
sense of Lemma |l|. Needless to say, this does not directly mean the achievability concerned with the 
error probability nor the transmission rate, since the tests {p x n — e™ a er® n > 0} for different x n do not 
commute each other and can not be realized simultaneously on W® n . In this section we will give a packing 
procedure of the tests to meet the condition of the quantum measurement on Tt® n , which leads to the 
following lemma. 

Lemma 2 Let x € X i — > p x G S{TL) be a quantum channel, and let a = f ^2 xeX p(x)p x be the mixture 
state by a probability distribution p £ V(X). Suppose that the following condition holds for the pinching 

p x d = £a(px) with some real numbers S > and c > 0: 

^p(x)Tr[fefe-ca>0}]>l^. (17) 

xGX 

Then, for any real numbers 7 > and r\ > 0, there exist a codebook C — {uk}^ =1 C X with cardinality 
M and a decoder X = {Xk}^- 1 on Ti such that 

T M 

min{ry,l-(5-7-20?}< — , (18) 



7 + 5 ' v c 

M 



l -Y J {l-Tv[p Uk X k ])<5 + 1 + 2^]. (19) 



M 



k=l 



Before proceeding to the proof of Lemma ||, let us consider the n-th extension of the lemma replacing 
the symbols above as follows: 

#<— Px ^p x ~, P ^ P n , o-^a® n , c^e na , 7 <— 7 „, V *— e~ nX , (20) 
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where a is a real number and A > is an arbitrarily small number. Letting p x n = £ a ® n {Px n )i we can 
easily see that ( |l7| ) is satisfied by setting the following 6 n (a) in place of 6, 



J„(«) = E P"(^)Tr[p x „{p x „-e"V^<0}]. (21) 
Therefore we obtain the following theorem using Lemma 0. 



Theorem 1 With the above notation, there exist a codebook C n = «, c X n with cardinality M. : 

an 

1; 



id a decoder X n = {Xfc}^ on U® n such that 



.. -in (e-"\ 1 - 5„(a) - 7 n - 2V^| < e -" a M„, (22) 
7„ + d„(o) L J 

Pe (C™, X") < «J„(a) + 7„ + 2Ve-" A . (23) 

For any a < /(p), we have lim„^ 00 <5„(a) = by Lemma [j], and hence we can choose 7n such that 



linin—xxj 7„ = and liirin^oo — ? ; = 1. Therefore the above theorem yields 



liminf-logM„ > a, (24) 
lim Pe(C n ,X n ) = 0, (25) 

n — >oc 

since A > can be arbitrarily small. Thus we have given a new proof of the direct part of the quantum 
channel coding theorem except that the proof of Lemma || remains to be shown. 

Proof of Lemma [|: To begin with, let us define a set of candidates for codewords by 

X' {x € X \ Tr[p x {p x - ca > 0}] > 1 - 8 - y} . (26) 
Then the probability of X' is bounded below as 

ft' 1 + 6 

which is verified as follows: 

l-*< J2p(x)Tr[p x {p x -ca>0}] 

xGX 

= p( x ) Ti 'Ipx iPx - c ° > °}] + p( x ) Ti 'Ipx {~Px~ c(j > o}] 

xEX' x£X\X' 

<p(X') + (l~p(X'))(l-S~~f). (28) 

Utilizing the normalization technique for operators developed by Winter (see the proof of Theorem 10 in 
0), a codebook C = {uk]^f-i Q X' and a decoder X — {Xk}^_ 1 are constructed by the following greedy 
algorithm, along with the operators Sk S £{H) (A = 0, 1, ... , M) for normalization. 

(a) Let S = Q. 

(b) Repeat the following procedures for k = 1, 2, 

(b-1) If there exists an a; € X'\{u±, . . . , Uk-i} such that Tr\p x Sk-i] < ?7, then choose such an x 
arbitrarily and define 

dcf 
Uk = x, 



X k d = y/T- Sk-i {p Uk - co > 0}y/l - Sk-i, 

else go to (c). 



Sk = Sk-i + Xk, 
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(b-2) Let k < — k + 1 and go back to (b-1). 

(c) Letting M< — k,C d = {ufcjfeli and X d = {X k }%L v end the algorithm. 

It should be noted here that each of X k and S k commutes with cr, and each S k satisfies the following 
inequalities 



< S k < I, 

which follows from 

< {p x - ca > 0} < I 
and induction. Moreover it holds that for Vx 6 X'\C 

Tt[p x S m ] > V, 



(29) 



(30) 



(31) 



while for any codeword u k G C we have Tr[p Uk S k -i] < rj. Thus, applying Winter's gentle measurement 
lemma (Lemma 9 in Q, see Appendix ^]), we obtain 



Pu k - V I - s k-iPu k S k -i < 



(32) 



Therefore, using the property of the pinching (|4lj), we have 
Tr[p Uk X k ] = Tr[p Uk X k ] 

V 1 ~ S k-lPu k V 1 ~ S k-1 {Pu k ~ ca > °} 



Tr 
Tr 



-Tr 



Pu k {Pu k -C<7 > 0} 

>Tr[p llfc {p Uk ~ca>0}]~2^j 
> 1 - 6 - 7 - 2^rj, 



\Pu k - V 1 - Sk-iPu k \Jl- Sk-ij {p Uk - ca > 0} 



(33) 



where the last inequality follows from u k £ X' , and hence 



Tr[p u , S M ] = Tr 



Put (^Yl X kj 



> Tr[p uk X k ] 

> 1 - 5 - 7 - 207- 



(34) 



Now, in order to evaluate the cardinality of the code M, we will estimate the lower and upper bound of 
Tr[aS M }- Using (|l|) and (||), the lower bound is obtained as 



Ti[aS M } = ^2 p(x)Tr[p x S M ] 

> p( x ) Tt Ipx s m] 

> p(x) min {rj, 1 — S — 7 — 2^/rj} 



x(£X> 



> 



7 + 5 



-5-7- 20?}, 



(35) 



G 



where the last inequality follows from (|27|). On the other hand, Tr[(T5ji/] is bounded above as 

M 

Tr[aS M } = T ^ X k] 

M 

= X) ^ [V 7 - Sk-i - S k -i {p Uk - ccx > 0} 



fe=l 

M 



<E Tr ^{P« fc -ca>0}] 



(36) 



fc=i 



since we have \/I — Sk-i cry/ 1 — Sk-i < c, which follows from ( p^ ) and the fact that each \/T— Sk-i 
commutes with a. Moreover, (pm is bounded above further as 



M 

Tr[aS M ]<J2-^[Pu k {p Uh -ca>0}} 



< 



k=l 

M 



(37) 



by the definition of {p Ul — ccr > 0V Now the assertion (|lj) is shown by combining the lower bound (35) 
with the upper bound (p7|), and ( |l9| ) follows from (|3^). ■ 



4 Concluding Remarks 

We have revisited the direct part of the quantum channel coding theorem j2| || considering the 
relation with the hypothesis testing problem, and an upper bound on the error probability similar to 
Feinstein's lemma [[l0| has been obtained. The approach used here is regarded as an attempt to develop 
the information-spectrum method Jl3| in quantum channel coding. 



Appendices 

A A property of the pinching 

We summarize a property of the pinching related to a *-subalgebra in C(TL) for readers' convenience. 
In the sequel, we denote the *-algebra generated by operators {/, Ai, A2, . . ■ } C C{TL) as \A\, A2, ■ ■ .]. 
Given a Hermitian operator A G £(H), let 

V 

A = Y,aiEi (38) 
»=i 

be its spectral decomposition, where each Ei is the projection corresponding to an eigenvalue ai different 
from others and v is the number of the eigenvalues. Then we can easily see that {A} = [Ei, . . . ,E v j 
holds. Actually, it is clear that we have the inclusion [A] C \E\, . . . , E V J from (|38|). On the other hand, 
considering the following function for each i, 

/<(a .)^ Jj£ZfL, (39) 
we have Ei = fi(A), which yields the converse inclusion [A] D [J5i, . . . ,E v j. 
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Let us denote the commutant of {AJ as 

{Af d = {B e C(H) | vc e [A], BC = CB} 
= {BeC{H) | BA = AB} 

= {BeC(H) \BE i =E i B(i = l,...,v)}. (40) 

Then the pinching £a ■ C(7i) — > \A\' C £(H.) is dehned as the projection of an operator to the *- 
subalgebra {Af so that 

WCe [A]', ((B,C)) = ((£ A (B),C)) (41) 

holds for VB e C(H), where 

((A, B)) d = Tr[A*B] (42) 
is the Hilbert-Schmidt inner product. Note that the pinching can be written explicitly as 



£ A {B) ^Y.EiBEi. (43) 

i=l 

We used the following property of the pinching to show (|l2] ) in the section 0. 

Lemma 3 Let A,B,C,D e C{TL), and define block diagonal matrices onH. Q)H. by X d = A® B and 
Y = C®D with the same notation as used in the section [|. Then we have 

£ Y {X)=£c(A)®£ D (B). (44) 

Proof: For any operator Z = ( f 11 f 12 ) e lYf, ZY = YZ yields Z u e ICf and Z 22 S WV by a 
direct calculation. Thus, we have 

Tr[XZ] = Tr[AZ n ] + Tr[BZ 22 ] 

= Tt[£ c (A)Z 11 ] + Ty[£ d (B)Z 22 ] 

= Tr [(£c(A)®£ D (B))Z}. (45) 
Now the assertion has been proved, since £c(A) © £d{B) € \Y\' is obvious and Z e [Y]' is arbitrary. ■ 

B Hiai-Petz's Theorem and its variants 

In this appendix we summarize Hiai-Petz's theorem jlTj] and its variants |l6| in quantum hypothesis 
testing, which is used to show Lemma [l]. Given p and a in S(Tl), let us consider the hypothesis testing 
problem about hypotheses H : p® n e S{H® n ) and H 1 : a® n e S{l-l® n ). The problem is to decide 
which hypothesis is true based on a two- valued quantum measurement {Xq,Xi} on 7i® n , where the 
subscripts and 1 indicate the acceptance of Hq and Hi, respectively. In the sequel, an operator 
A n e £(H® n ) satisfying inequalities < A n < I n is called a test, since A n is identified with the 
measurement {A n , I n — A n }. For a test A n , the error probabilities of the first kind and the second kind 
are, respectively, defined by 

a n (An) d ^ Tr[p® n (I n - A n )}, (3 n (A n ) d ^ f Tr[a® n A n ]. (46) 

For < Ve < 1, let us define 

P*(e) d = min{/3„(A„) | A n : test, a n (A n ) < e} , (47) 
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and recall the quantum relative entropy: 



D(p\\a) d ^TT[p(logp-loga)}. (48) 
Then we have the following theorem, which is called the quantum Stein's lemma: 

lim -log /3*(e) = -D(p\\a). (49) 

n—>oo Tl 

The first proof of ( f49| ) was shown by two inequalities. One is the direct part given by Hiai-Petz ]l7[ : 

limsupilog/^e) <-D(p\\a), (50) 

n — >oc TL 

and the other is the converse part given by Ogawa-Nagaoka [^o| . 

Preceding the direct part ( |50|) , Hiai-Petz (lTj] proved the following theorem 

D{p\\&) = lim -sup£> M „ (p® n \\*® n ) , (51) 

rwoo n M n 

where the supremum is taken over the set of quantum measurements on H® n , and Dm„ (p®™||<7 <gln ) is the 
classical relative entropy (Kullback divergence) between the probability distributions {Tr[p®"M ra ^]} and 
{Trfcr^Mn^]}. Note that the monotonicity of the quantum relative entropy ^|) Q yields D(p\\o) > 
1/n ■ Dm„ (p* 1 ™!^®™) for any measurement M n , and in addition there exists a measurement that attains 
the equality if and only if p and a mutually commute. Hiai-Petz combined ( |5l| ) with the classical 
hypothesis testing problem to show the direct part (|50|). 

In recent developments |2l| jl6) , we found direct proofs of ( f50| ) without using the achievability of the 



information quantity (pl[). Here, we will make use of a simple test defined below with the notation (13) 
and the pinching £ CT « n (p®") (see Appendix |X|): 

S n {a) = { £ a ® n (p® n ) - e na a® n > 0} , (52) 
which satisfies the following lemma (see Theorem 2 in fl6|). 

Lemma 4 (Ogawa-Hayashi) For < Vs < 1, we have 

On (Sn(a)) < ( n + iy^-H e n[as-i,{s)]^ (53) 

Pn (S n (a)) < e~ na , (54) 

where 

^(s) d = -logTr [paip-'tri] . (55) 



Observing that ip(0) = and ^'(0) = D(p\\a), we can show that for Va < D(p\\a) 

0n (S n (a)) < e- na . (56) 

C Winter's Lemma 

In this appendix Winter's gentle measurement lemma (Lemma 9 in 0) is explained. The original 
proof of the lemma by Winter is based on a study of the relation between the trace norm distance and 
the fidelity. Here, a direct proof of the lemma is given for readers' convenience accompanied by a little 
improvement of the constant in the upper bound. 



lim a n (S n (a)) = 0, 

n — >oo 
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Lemma 5 (Winter) For Vp € S(Tt) and VX £ C(J~l) satisfying inequalities < X < I, we have 

p- VXpVx i < 2y/Tr[p(I - X)]. (57) 

Proof: A direct calculation yields 

p - VXpVx = ( I - Vx + Vx) p - Va>VT 



< 



i-Vx) P + Vxp(i~Vx 



Tr 



(l - \/X^j VP • Vp| + Tr | VXy/p ■ ^fp~(l - \fX 



< WTr 



p(i-Vx 



Tr[p}+JTr[pX}-Tr 



p(i-Vx 



<2y/Tr\p(I-X)], 



(58) 

(59) 
(60) 



where ( p8[ ) follows from the triangle inequality, ( p9| ) follows from the Cauchy-Schwartz inequality for 
operators, and we used (1 — \/~x) 2 < (1 — x) for < x < 1 and Tr[pX] < 1 in (p 
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